Adjustable deterministic pseudonymization of speech
نویسندگان
چکیده
While public speech resources become increasingly available, there is a growing interest to preserve the privacy of speakers, through methods that anonymize speaker information from while preserving spoken linguistic content. In this paper, method for pseudonymization (reversible anonymization) presented, allows obfuscate identity in untranscribed running speech. The approach manipulates spectro-temporal structure simulate different length and vocal tract by modifying formant locations, as well altering pitch speaking rate. deterministic partially reversible, changes are adjustable on continuous scale. has been evaluated terms (i) ABX listening experiments, (ii) automatic verification recognition. experimental results indicate identifiability among forced choice pairs reduced over 90% less than 70% pseudonymization, de-pseudonymization was effective. An evaluation VoicePrivacy 2020 challenge data showed proposed performs better signal processing based baseline uses McAdams coefficient slightly worse neural source filtering method. Further analysis approach: comparable phone posterior feature objective intelligibility measure, preserves tracks method, (iii) paralinguistic aspects such dysarthria several speakers.
منابع مشابه
Implementation and analysis of NFAs with adjustable deterministic blow-up
Non-deterministic and deterministic automata both have the expressive power to recognize regular languages; thus, for any non-deterministic automaton there exists an equivalent deterministic automaton that recognizes the same regular language (and vice versa). Depending on the regular language recognized, however , the number of states in minimal NFAs and DFAs recognizing the language may diffe...
متن کاملCircumventing IP-address pseudonymization
This paper presents an attack that circumvents anonymization of IP addresses in IP network traffic data in O(n) time, or O(n) time under certain circumstances. The attack is based on packet injection, and circumvents all anonymization techniques that assign a static and unique pseudonym to an IP address. It turns out that the packet injection itself, as well as the extraction of the correspondi...
متن کاملthe effects of speech rate,prosodic features, and blurred speech on iranian efl learners listening comprehension
کلید واژه ها به زبان انگلیسی: effect of speech rate on listening comprehension, blurred speech,segmental and suprasegmental features,authentic speech,intelligibility, discrimination, omission, assimilation چکیده: سرعت مطالب شنیداری در کلام پیوسته بطور کلی همواره کابوسی بوده برای یادگیرنده های زبان دوم و بالاخص برای شنوندگان ایرانی. علی رغم عقل سلیم که کلام با سرعت کندتری فعالیتهای درک مطلب شن...
15 صفحه اولImproving Patients Privacy with Pseudonymization
e-Health requires the sharing of patient related data when and where necessary. Electronic health records promise to improve communication between health care providers, thus leading to better quality of patients' treatment and reduced costs. As highly sensitive patient information provides a promising goal (e.g., for attackers), there is an increasing social and political pressure to guarantee...
متن کاملPseudonymization of patient identifiers for translational research
BACKGROUND The usage of patient data for research poses risks concerning the patients' privacy and informational self-determination. Next-generation-sequencing technologies and various other methods gain data from biospecimen, both for translational research and personalized medicine. If these biospecimen are anonymized, individual research results from genomic research, which should be offered...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computer Speech & Language
سال: 2022
ISSN: ['1095-8363', '0885-2308']
DOI: https://doi.org/10.1016/j.csl.2021.101284